Reply to Comment on "Thermal Model for Adap- 
tive Competition in a Market": In our Letter Q we 
introduced a generalization of the minority game (MG) 
|^| , called the thermal minority game (TMG). One of the 
main new features was allowing for the stochastic deci- 
sion making of the agents, controlled by a temperature T. 
In the completely deterministic case T — 0, the original 
MG is recovered. 

In their Comment [|| Challet et al. claim that: (i) 
the equations of our model reduce to Eqs. (2) of S; (ii) 
Eqs. (2) of H lead to the "exact solution of the MG"; (iii) 
the crossover to a random value of the variance for T ^> 1 
found in Figs. 2-3 of jl| is due to short waiting times 
in the simulations. Remarks (i) and (ii) are incorrect. 
Point (iii) is true, and highlights even more the crucial 
role of the temperature in the TMG. The central claim 
of (|] is that the effects of the temperature in the TMG 
"can be eliminated by time rescaling" and consequently 
the behaviour of the TMG is "independent of T" . These 
statements have no general validity. 

Challet et al. obtained their Eqs. (2) of || by averag- 
ing our equations both over the noise ff and the individ- 
ual strategy distribution. Averaging over ff is legitimate, 
since it preserves the full macroscopic dynamics for all d 
and T. However, averaging over the individual strategies 
as in S is too naive and misses important correlated fluc- 
tuations. Besides, this procedure is conceptually wrong: 
replacing R* by its average is equivalent to allowing the 
agents (who have s fixed strategies available) to play with 
any strategy formed by a linear combination of the s fixed 
ones. This is not sensible and contrary to the original 
spirit of the model 

In Fig. 1 we demonstrate the above assertions by com- 
paring the results of simulations on the TMG with those 
resulting from the equations of Challet et al. Not surpris- 
ingly, for T ~ the approximation leading to Eqs. (2) of 
H works well (at T = there is no stochasticity and the 
average is equal to the best strategy). However, as soon 
as we turn on T, this approximation fails to reproduce 
the correct behaviour. Clearly, Eqs. (2) of || miss fluc- 
tuation effects and do not describe the behaviour of our 
system for all T ^ 0. 

In Wl Challet et al. approximate the r.h.s. of Eqs. (2) 
of H as the gradient of an effective Hamiltonian Ti 
and study the MG by minimizing Ti.. This procedure 
is predicated on the assumption of equilibration of the 
strategy- use probabilities nf (t). This assumption is false 
for d < d c ||. The consequence is that although their 
method correctly accounts for the phase d > d c , it fails 
completely for d < d c (see Fig. 1 of 0). Thus, to claim as 
Challet et al. do in (ii), that they have found the "exact 
solution of the MG" is incorrect and misleading. 

Remark (iii) of the Comment is correct: the crossover 
to a random variance for large T observed in Q is due 
to finite simulation times. The time required to reach 
the steady state for T > 1 is of order NT. This is a 


very interesting observation. It means that in the phase 
d < d c , for any finite temperature larger than a critical 
value T c ~ 1 M, the performance of the system will be 
better than the original MG, provided that we wait long 
enough. In the inset of Fig.l we show a vs. T for waiting 
times much larger than NT for the values of d of Fig. 2 
of §. 

In other words, what Challet et al. have correctly 
pointed out in their Comment is that any degree of 
stochasticity above a given threshold makes the TMG 
perform better than the MG. The claim that "the col- 
lective behaviour should be independent of T" is clearly 
wrong. The remarkable structure of the TMG with the 
temperature begs for further investigation. 
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d=D/N 

FIG. 1. a vs. d in the TMG (• T = 0, ■ T = 0.32), and 
with the approximation of the Comment with a calculated as 
in [4], a 2 = £ i3 (.R* ■ R*) (o T = 0.02, O T = 0.32). Open 
squares are the quantity JY.(.Ri) ■ {Rj}- s = 2, N = 100, 
t = to = 10 4 , 10 2 samples. INSET: a vs. T for d = 0.1 and 
d = 0.25, for waiting time t ~ 10 6 > NT. 
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